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ABSTRACT Experimental evolution studies, coupled with new advances in DNA sequencing technology, KEYWORDS 
have become a powerful tool for exploring how populations respond to selection at the genomic level. experimental 
Recent experiments in microbes typically have found evidence for multiple novel mutations, which are evolution 
usually fixed. In contrast, in animal model systems, evolutionary responses seem to involve more modest selective sweep 
changes in the frequencies of pre-existing alleles, probably because these populations outcross and are compensatory 
usually initialized with greater levels of standing variation. In this experiment, I used whole-genome adaptation 
resequencing to estimate allele frequencies and look for novel substitutions in experimentally evolved sex 
populations of Caenorhabditis elegans. These populations were founded with a fixed pair of deleterious determination 
mutations introgressed into multiple wild genetic backgrounds and allowed to evolve for 50 generations tra-2 
with a mixed mating system. There is evidence for some recombination between ancestral haplotypes, but 
selective sweeps seem to have resulted in the fixation of large chromosomal segments throughout most of 
the genome. In addition, a few new mutations were detected. Simulations suggest that strong selection and 
low outcrossing rates are likely explanations for the observed outcomes, consistent with earlier work show- 
ing large fitness increases in these populations over 50 generations. These results also show clear parallels 
to population genetic patterns in C. elegans in nature: recent selective sweeps, high linkage disequilibrium, 
and low effective recombination rates. Thus, the genomic consequences of selection depend heavily on the 
biology of the organism in question, including its mating system and levels of genetic variation. 



Experimental evolution has long been a powerful tool for biologists 
(e.g., Malmberg 1977; Rose 1984; Dodd 1989; Lenski et al. 1991; Rice 
1992). Only recentiy, however, has the widespread adoption of high- 
throughput DNA sequencing technology made "evolve-and-resequence" 
a feasible experimental strategy (e.g., Burke et al 2010; Turner et al. 2011; 
Remolina et al. 2012). In this approach, populations of a model organism 
are reared in the laboratory for a number of generations, usually using an 
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experimentally manipulated selective agent. After experimental evolu- 
tion, whole-genome resequencing is used to identify genetic loci re- 
sponding to selection. This experimental design has the potential to 
shed light on a number of important evolutionary questions, including 
the repeatability of the genetic basis of evolutionary change and how 
selection affects variation at linked sites (Kawecki et al. 2012; Gray and 
Cutter 2014). 

To date, evolve-and-resequence studies have largely fallen along 
two ends of a continuum. On one end, populations are initiated with 
a single isogenic progenitor, relying entirely on de novo mutations to 
drive adaptation, and typically lacking genetic recombination. This 
design is typical of microbial experiments (e.g., Araya et al. 2010; 
Charusanti et al. 2010). Such studies, in general, have found a handful 
of novel substitutions in adapted populations, with mutations in sim- 
ilar genes or pathways often occurring convergently in independent 
experimental replicates (Araya et al. 2010; Charusanti et al. 2010; 
Comas et al. 2011; Dettman et al. 2012). 

On the other end of this continuum are studies initialized with 
a considerable degree of standing genetic variation and with obligately 
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outcrossing populations, typically using animal systems (to date, 
mostly Drosophila melanogaster). These organisms typically also have 
larger genomes, potentially offering more mutational targets for ad- 
aptation, although the population mutation rate (N e |ji) often is smaller 
in absolute magnitude because of the population sizes used. Although 
the repeatability of genetic changes is difficult to assess in these studies 
because they often identify candidate loci by looking for shared pat- 
terns across replicates, there usually are multiple loci showing a com- 
mon signal (Burke et al. 2010; Turner et al. 2011; Zhou et al. 2011; 
Orozco-Terwengel et al. 2012; Remolina et al. 2012; Turner and Miller 
2012; Tobler et al. 2014), suggesting that some degree of repeatability 
is common. However, in contrast to microbial studies, animal studies 
typically find polygenic responses involving ancestral alleles rather 
than novel mutations and fewer fixed sites. One notable exception 
among these animal studies involved the nematode worm Caenorhab- 
ditis elegans, in which a handful of novel substitutions became fixed, 
likely due to selective sweeps (Estes and Lynch 2003; Denver et al. 
2010). However, this study adopted a setup similar to microbial 
experiments, involving isogenic, self-fertilizing populations. Thus, 
the differences in outcomes between microbial and animal studies 
are likely explained by both their experimental designs and the biology 
of the study organisms. 

Although the two endpoints of this continuum may provide 
reasonable laboratory models for many organisms, there are of course 
others that fall somewhere in between. C. elegans, for instance, is 
androdioecious, with primarily self-fertilizing hermaphrodites and 
outcrossing males. In natural populations, male frequencies and out- 
crossing rates are low (Barriere and Felix 2005, 2007). Genetic di- 
versity in this species is also low, and there are large blocks of 
strong linkage disequilibrium, even extending across separate chro- 
mosomes (Rockman and Kruglyak 2009; Andersen et al. 2012). Pat- 
terns of genetic variation in natural populations provide evidence for 
recent and rapid selective sweeps (Andersen et al. 2012). In addition, 
effective population outcrossing rates estimated from population ge- 
netic data are orders of magnitude lower than those estimated directly 
from the occurrence of males and heterozygotes in nature, suggesting 
outbreeding depression or selection against recombinant genotypes, 
probably due to epistasis (Barriere and Felix 2007; Dolgin et al. 2007; 
Rockman and Kruglyak 2009). (However, some laboratory studies 
suggest that outcrossing may facilitate adaptation to novel or changing 
environments, at least in some situations; Morran et al. 2009; 2011; 
Teotonio et al. 2012). Based on these observations, selective sweeps 
extending across large chromosomal segments also might be expected 
to occur in androdioecious experimental populations of C. elegans, 
even in the presence of standing variation. If these sweeps involve 
ancestral variants rather than novel mutations, then repeatable out- 
comes are also predicted. 

In this work, I follow up on an earlier experimental evolution study 
in C. elegans to test these predictions. These populations were fixed for 
a pair of mutations rendering sex determination temperature-sensitive 
but introgressed into multiple wild genetic backgrounds, so they were 
founded with standing genetic variation (Chandler et al. 2012). They 
also exhibited a mixed mating system, with some degree of facultative 
outcrossing (at least initially), allowing for some recombination be- 
tween divergent haplotypes. Ten populations were evolved at each of 
two intermediate temperatures for 50 generations. In these environ- 
ments, the temperature-sensitive mutant genotype exhibits dramati- 
cally deleterious effects: fecundity is reduced by up to 75%, with as 
many as half the worms in the founder populations being infertile or 
sterile (Chandler et al. 2009; 2012). Ancestral worms at these temper- 
atures also exhibit obvious intersexuality in their tail morphology, 



which is sexually dimorphic in this species. After 50 generations of 
experimental evolution, however, fertility rates and tail morphology 
have recovered in some populations to near wild-type levels, suggest- 
ing that selection has favored compensatory alleles suppressing these 
deleterious mutant phenotypes. However, transcript levels of five key 
sex- determining genes cannot explain this fitness recovery: there are 
clear expression differences between wild-type and ancestral mutant 
worms, but expression patterns in evolved populations are overall 
much more similar to ancestral mutant worms than to wild-type 
worms (Chandler et al. 2012). This finding suggests that other changes 
underlie these apparently compensatory adaptations. Genetic variants 
segregating among wild isolates are known to modulate the pheno- 
typic effects of these mutations (Chandler 2010), and some of these 
modifier alleles may have become fixed; new mutations in these pop- 
ulations might contribute as well. 

Here, I use population-level, whole-genome resequencing in these 
populations to address the following questions: (i) to what degree have 
ancestral alleles become fixed, and how much recombination has 
occurred among these ancestral haplotypes? (ii) How similar are 
genomic patterns across experimental replicates? (iii) Is there evidence 
of any novel mutations? (iv) What selective scenarios best explain 
these outcomes? The results suggest that selection played an important 
role in generating the observed genomic outcomes, which here highly 
convergent across replicates and show similarities to evolve-and- 
resequence studies of isogenic microbial populations as well as 
outcrossing, genetically variable animal model systems. Moreover, 
the results also show clear parallels to patterns of genetic variation in 
natural populations of C. elegans — namely, strong selective sweeps 
with limited recombination among ancestral haplotypes. 

MATERIALS AND METHODS 
Genome resequencing and analysis 

A detailed description of the experimental evolution populations has 
been previously published (Chandler et al. 2012). In summary, the 
C. elegans strain CB5362 tra-2(ar221)U; xol-l(y9)X carries two muta- 
tions (in an N2 genetic background) that together result in an entirely 
XX population. These worms are hermaphrodites at cooler temper- 
atures (<13°) and males at warmer temperatures (>24°). At the in- 
termediate temperatures used in this experiment (16° and 18°), 
ancestral populations are a mix of males and hermaphrodites but with 
a high frequency of intersex phenotypes and strongly reduced fitness 
(Chandler et al. 2009, 2012). These mutations were introgressed into 
four additional wild-type genetic backgrounds (CB4856, AB1, MY2, 
and JU258; Chandler 2010; Chandler et al. 2012). Populations were 
founded with 10 hermaphrodites and 12 males with each ancestral 
genetic background; 10 replicate populations were evolved at 16° and 
another 10 at 18°. At generation 50 (g50), evolved populations dis- 
played substantial phenotypic recovery (Chandler et al. 2012). Three 
populations showing strong evolutionary responses (18EE1, 18EE2, 
and 16EE6) were chosen for sequencing. In addition, I sequenced 
an ancestor sample consisting of DNA from pooled gl worms of all 
three replicates. 

Frozen samples of each population were thawed and grown at 
a permissive temperature (13°) for three to four generations before 
DNA isolation. For each population, hundreds of worms were washed 
en masse from Petri dishes in sterile water. DNA was extracted from 
each sample using a QIAGEN DNeasy Blood & Tissue Kit (QIAGEN, 
Valencia, CA) following the manufacturer's instructions. The pooled 
ancestor sample and three evolved samples were submitted to the SUNY 
University at Buffalo Next-Generation Sequencing and Expression 
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Analysis Core for sequencing on an Illumina HiSequation 2000 with 
single-end, 51-bp reads (1x51). In addition, I downloaded sequence 
data for wild-type strains CB4856, AB1, MY2, and JU258 from the 
NCBI Sequence Read Archive (accession numbers SRX219150, 
SRX128707, SRX218993, SRX218998, and SRX218971); these were 
paired-end reads (2x77 and 2x101). I mapped sequence reads to the 
reference C. elegans genome (WormBase release WS235) using 
BWA v0.7.5a (Li and Durbin 2009), allowing up to two mis- 
matches for the 1x51 datasets, and using the default parameters 
with the BWA-MEM mapping algorithm for the other datasets. I 
called single-nucleotide polymorphisms (SNPs) using SAM- 
Tools vO.1.18 (Li et al. 2009), excluding sites with coverage 
greater than lOOx to avoid calling false SNPs in paralogous repeats 
and restricted further analyses to SNPs with a minimum quality 
score of 30. 

First, I wished to assess, for each position in the genome, which (if 
any) of the ancestral genetic backgrounds had become fixed. To do so, 
I identified SNP alleles unique to each of the five founder genetic 
backgrounds at nucleotide sites with at least lOx coverage in every 
background by using a custom R script. I then divided the genome 
into nonoverlapping 200-kb windows and calculated the average fre- 
quency of the alleles specific to each ancestral genetic background in 
each window, in each population. 

Next, I looked for evidence of fixed de novo mutations, by identi- 
fying nucleotide substitutions that were nearly fixed (>90% fre- 
quency) in at least one evolved line but absent from the pooled 
ancestor and all the founder wild-type genetic backgrounds (and sup- 
ported by at least lOx coverage in all cases). 

Simulation analyses 

I used simulations to assess whether the observed outcomes were 
consistent with various levels of selection, numbers of fitness quantitative 
trait loci (QTL), and outcrossing rates. The source code for the 
simulation is archived on Dryad (DOI: 10.5061/dryad.603d2). 

Simulations were written to reflect the design of the evolution 
experiment. In each run of the simulation, 20 populations were 
evolved for 50 generations each. The three populations with the 
greatest average fitness were selected and used to compute two test 
statistics: (i) s, the proportion of genomic positions in which the same 
allele was "dominant" (had the highest frequency) in all three pop- 
ulations; and (ii) f, the average proportion of sites fixed (in which the 
dominant allele has a frequency of at least 0.9) in each population. The 
expected distribution and 95% confidence intervals for each of these 
two statistics were computed using 200 replicate simulations for each 
set of parameter values (number of QTL, outcrossing rate, etc.). The 
observed values of these statistics (generated from the sequence data) 
were then compared with the expected distributions to generate a 
p-value for each scenario. A small p-value (< 0.05) for a given sce- 
nario would indicate a poor fit for the observed data, whereas a larger 
p-value for both statistics would indicate that the parameter values are 
consistent with the experimental results. 

Each simulated population was initialized with 10 hermaphrodite 
and 12 male worms of each founder genetic background (N2, CB4856, 
AB1, MY2, and JU258), and subsequently expanded to a population 
size of 1000, with nonoverlapping generations. Each worm in the 
simulation carried a diploid XX genome, composed of six chromo- 
some pairs of 69—105 "windows" each (corresponding to the 200-kb 
windows used in the analysis of the actual evolved genomes). Alter- 
native population sizes of 200, 500, and 2000 also were tested and gave 
similar results (Supporting Information, Table S2). 



For each generation n, the relative fitness of each simulated worm 
in the population was first calculated based on its genotype. A value of 
1.0 was assigned as the default fitness. This fitness value was multiplied 
by the fitness assigned to each of the QTL alleles carried by the worm 
(see below for details on modeling QTL). The worm's phenotypic sex 
was then chosen randomly based on the user-specified outcrossing rate. 

Next, offspring genotypes for generation n+1 were generated one- 
by-one from the worm genotypes in generation n. To generate each 
offspring genotype, the first parent was selected randomly, weighted 
by relative fitness. If that parent was a hermaphrodite, the offspring 
genotype was generated by self-fertilization; if it was a male, a her- 
maphrodite mother was selected randomly, again weighted by relative 
fitness. Gamete genotypes were then generated by simulating meiosis. 
Each tetrad in the simulation underwent a single crossover during 
meiosis, mirroring the single crossover per chromosome pair observed 
in C. elegans (Hillers and Villeneuve 2003). After simulating crossing 
over, full gamete genotypes were obtained by randomly selecting a sin- 
gle chromatid for each of the six chromosome pairs making up the 
parental genotype. The complete offspring genotype was finally con- 
structed by uniting the two gamete genotypes generated from the 
parent worms. This process was repeated until enough offspring geno- 
types had been generated to populate generation n+1. Again, in each 
replicate run of the simulation, this entire cycle was repeated for 50 
generations in each of 20 populations, and the test statistics were 
computed for each of 200 replicates. 

I ran the simulation with various sets of parameter values to assess 
which could best explain the observed results. Although the 
simulation was implemented in C++ for speed, the number of 
variables precluded an exhaustive search of parameter space for an 
optimal model fit. Therefore, I searched for parameter values that 
could produce a reasonable fit as measured by the two test statistics, 
focusing on three key factors: outcrossing rate, maximum relative 
fitness, and number and distribution of fitness QTL. I tested five 
outcrossing rate scenarios: constant rates of 0.01, 0.05, and 0.10, and 
two variable rates (initially 0.05 or 0.10, dropping to 0.01 at 
generation 25). 

QTL were modeled by first selecting a value for the maximum 
relative fitness; five possible values for maximum relative fitness were 
tested: 1.1, 1.5, 2, 4, and 10. Then, for each maximum relative fitness 
value, various scenarios for the number and distribution of QTL were 
tested, based on the observed genotypes (see File SI for details on how 
models describing the locations of QTL were generated based on 
observed outcomes). Finally, fitness values for QTL alleles were 
assigned such that the optimal genotype (i.e., a simulated worm 
homozygous for all high-fitness alleles) would have the maximum 
relative fitness assigned earlier. For example, if the maximum rela- 
tive fitness was 2.0, with two diploid QTL with equal effects, then the 
fitness value for each QTL would be 2 1/4 ~ 1.189; the fitness of the 
optimal genotype (say, AABB if A and B represent the high-fitness 
alleles at each locus) would therefore be 1.189 4 ~ 2. I also tested 
a null model with no fitness QTL (i.e., neutral evolution). 

Finally, in addition to the test statistics described previously, the 
simulation provided genome-wide allelic distributions for simu- 
lated evolved populations, so that the simulated outcomes could 
be qualitatively compared to the observed outcomes. 

RESULTS 

Genomic patterns in experimental populations 

I obtained a total of -140 million single- end, 51-bp reads across the 
ancestral and evolved samples, resulting in at least ~9x genome-wide 
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Table 1 Candidate novel mutations in evolved lines 



Chrom. 


Pos. 


Ref. 


Alt. 


Present in 


Info. 


Ill 


4,895,774 


A 


C 


16EE6 


Leu— >Arg at amino 












acid 862 in nhl-2 


X 


8,801,613 


T 


A 


18EE1 


Intergenic 


X 


16,501,330 


C 


T 


18EE2 


Intergenic 



These candidate mutations were identified by searching for novel SNP alleles 
fixed in at least one evolved line (supported by at least 10x coverage) and 
absent from all ancestral sequences (again supported by at least 10x coverage). 
Chrom., chromosome; Pos., nucleotide position; Ref, reference allele; Alt., al- 
ternate allele; SNP, single-nucleotide polymorphism. 



coverage for each population (Table SI). By using the same analysis 
pipeline with the publicly available sequence data for strains CB4856, 
AB1, MY2, and JU258, 1 obtained ~30x coverage for each of the wild- 
type ancestral genetic backgrounds. Raw sequence files generated in 
this study are available from the NCBI Sequence Read Archive under 
accession numbers SRX658438, SRX658572, SRX658573, and SRX658574. 
Processed data and analysis scripts are archived in Dryad (DOI: 10.5061/ 
dryad.603d2). 

Approximately 558,000 putative SNPs were detected, for an 
average density of one SNP every 180 bp. Of these, approximately 
130,000 were specific to a single genetic background, for an average 
density of one diagnostic SNP per 750 bp, and 330 diagnostic SNPs 
per 200-kb window. Thus, power to detect heterozygosity within a 
given genomic window is fairly good because of the large number of 
diagnostic SNPs in each window, even though detecting heterozygos- 
ity at any individual SNP is low, given the modest sequencing depth. 
Indeed, substantial polymorphism was seen in the ancestor sample 
and on most of chromosome I in 16EE6. On the other hand, fixation 
or near-fixation of a single ancestral genetic background occurred 
throughout most of the genome in all the evolved lines; moreover, 
patterns of fixation were qualitatively similar in all three evolved lines 
(Figure 1). The large segments of contiguous ancestral backgrounds 
fixed on each chromosome in the evolved lines (most spanning several 
megabases) suggest that the genomic window size of 200 kb used here 
was small enough to capture the recombination events that have 
occurred in these populations (i.e., a single window is extremely un- 
likely to span more than one recombination breakpoint). 

I found evidence for three fixed novel mutations in total: one 
coding mutation in line 16EE6 (a missense mutation in nhl-2), and 
one noncoding mutation each in lines 18EE1 and 18EE2 (Table 1). 
The use of Bowtie2 (Langmead and Salzberg 2012) as an alternative 
read aligner yielded nearly identical results for overall genomic com- 
position and recovered the same three novel substitutions. Further- 
more, all three mutations were confirmed using direct Sanger 
sequencing of PCR products (see File S2 and Table S3 for details). 
For the ancestral population, chromatograms only showed evidence 
of the reference allele, while only the mutant alleles were present in 
the respective evolved populations (Figure S2). These results suggest 
that the mutant alleles are indeed absent from the ancestral popu- 
lation and fixed in the evolved populations, or at least that the 
frequency of the minor allele is below the detection limit in each 
case. 

Simulation results 

Simulation analyses strongly rejected a null model of neutral evolution 
both quantitatively (P < 0.01; Table 2) and qualitatively (Figure 1). 
Models incorporating selection and low levels of outcrossing (1%), on 
the other hand, were sufficient to explain the shared patterns of fix- 



ation across the three replicate lines (Table 2), and gave genomic 
patterns qualitatively similar to those observed (Figure 1 and Figure 
SI). Results from simulations with alternative population sizes (200, 
500, and 2000) were consistent overall (Table S2) in rejecting neutral 
evolution and supporting modest to large fitness gains for evolved 
worms, although greater outcrossing rates (5%) were also consistent 
with observed results in these cases. 

DISCUSSION 
Patterns of fixation 

Earlier work has shown strong phenotypic changes in these C. elegans 
populations over the course of 50 generations of experimental evolu- 
tion. In particular, although the tra-2(ar221); xol-l(y9) genotype these 
populations were founded with results in high rates of infertility and 
intersexuality at the rearing temperatures used in this experiment, 
both of these mutant phenotypes are largely suppressed in the evolved 
lines (Chandler et al 2012). The whole-genome resequencing data 
generated on this study shed light on the genomic changes that have 
accompanied this evolutionary response. 

The high degree of fixation and similar allelic compositions of these 
evolved lines supports an important role for selection in generating 
the observed genomic outcomes. Indeed, simulations strongly reject 
neutral explanations for the observed results, instead supporting a role 
for selection (Figure 1 and Table 2). These simulations also suggest 
that the outcrossing rate in these populations was probably low (1-5%) 
and that the overall relative fitness has at least doubled in these pop- 
ulations, consistent with the strongly deleterious effects of the tra-2 
(ar221); xol-l(y9) genotype and the large fitness recovery observed in 
the actual populations after 50 generations (Chandler et al. 2012). 
However, the simulations are not particularly informative with respect 
to the number of QTL; all four sets of QTL models produced similar 
outcomes for a given outcrossing rate and maximum fitness. Neverthe- 
less, multiple loci are probably involved, as different genetic backgrounds 
became fixed in different parts of the genome in a repeatable manner 
(Figure 1). 

In some ways, these results are similar to evolve-and-resequence 
studies in other animal models (Burke et al. 2010; Turner et al. 2011; 
Zhou et al. 201 1; Remolina et al. 2012; Turner and Miller 2012; Tobler 
et al. 2014). For example, ancestral alleles, rather than new mutations, 
seem to have played an important role in the adaptive responses seen 
in these populations, based on the similarity of patterns of fixation 
across replicate populations (Figure 1). Indeed, evidence of just a single 
novel substitution was found in each population (Table 1), fewer than 
most studies involving microbes and one other study in C. elegans 
(Denver et al. 2010), although the sequencing approach used here 
could have missed some, such as substitutions in low-coverage regions 
or structural mutations. This paucity of new mutations may be 
explained by the presence of standing variation in the ancestral pop- 
ulations, which likely provided a rich pool of alleles for selection to act 
upon immediately. Indeed, a QTL mapping study showed that these 
genetic backgrounds harbor multiple polymorphisms modulating the 
effects of the tra-2(ar221); xol-l(y9) genotype used in this experiment 
(Chandler 2010). However, these populations also evolved for fewer 
generations than most microbial experimental evolution studies (e.g., 
Araya et al. 2010; Charusanti et al. 2010), which may have limited the 
opportunity for new mutations to accumulate. 

In other aspects, on the other hand, these results resemble those of 
microbial studies. For instance, near-complete fixation, rather than 
moderate changes in allele frequencies, occurred throughout almost 
the entire genome in these populations (Figure 1). In addition, although 
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some recombination among ancestral haplotypes did occur, entire 
chromosomal segments became fixed, suggesting that outcrossing 
rates in these populations were low, which was supported by simu- 
lations (Table 2), although segregating inversions could potentially 
explain this result too (Tobler et al. 2014). 

These populations also display interesting differences from other 
studies looking at genetic diversity in experimentally evolved C. elegans 
populations. For instance, Chelo and Teotonio (2013) found much 
greater levels of diversity after 100 generations of experimental evolu- 
tion in a novel environment. These differences can likely be explained 
by several factors. First, outcrossing rates in those populations were 
much higher (-20%; Teotonio et al. 2012). The lower outcrossing 
rates in my populations were likely caused by the mutant tra-2 
(ar221); xol-l(y>9) genotype, resulting in low- fitness males at these 
temperatures, and evolved populations are almost exclusively her- 
maphrodites (Chandler et al. 2012). In addition, the populations 
used by Chelo and Teotonio (2013) were initialized with variation 
from 16 wild genetic isolates using a more involved funnel cross, 
whereas my populations were established simply by placing several 
worms of each of five wild genetic backgrounds together in a single 
cross. The more controlled crossing scheme in their study therefore 
probably generated a much larger number of recombinant haplo- 
types at the very start of the experiment. Finally, the populations of 
Chelo and Teotonio (2013) also probably experienced weaker selec- 
tion (selection coefficients around 2%, whereas fitness has probably 
at least doubled in this study). The stronger selection in this study is 
probably attributable to the deleterious effects of the mutant geno- 
type fixed in these populations. 

On the other hand, there are intriguing similarities to patterns of 
diversity seen in natural populations of C. elegans. Recent studies 
have found small numbers of haplotypes and strong linkage dis- 
equilibrium in wild isolates of C. elegans (Rockman and Kruglyak 
2009; Andersen et al. 2012). These observations provide evidence 
for recent global selective sweeps, that outcrossing rates are low 
(Barriere and Felix 2005; 2007), and that selection against recombi- 
nant haplotypes drives effective population outcrossing rates even 
lower (Barriere and Felix 2007; Rockman and Kruglyak 2009). 
Similarly, the populations in this study showed evidence of low re- 
combination and a selective sweep leading to the nearly complete 
fixation of a single haplotype in each population within 50 gener- 
ations. Although the loss of diversity in these experimental popula- 
tions is even more extreme than that seen in nature, the difference 
may be partially explained by the lack of metapopulation dynamics 
thought to characterize wild C. elegans populations (Barriere and 
Felix 2007); migration could reintroduce diversity into local subpo- 
pulations. Finally, the evolution of highly similar recombinant hap- 
lotypes in independent and completely isolated populations in this 
study suggests that shared haplotypes observed in distant localities in 
nature may not necessarily have arisen through migration, but per- 
haps instead through convergence. 

In addition to the widespread fixation of ancestral alleles, one fixed 
novel mutation was detected in each population. The functional 
significance of these mutations is unclear. One is a missense mutation 
in nhl-2, which is involved in miRNA regulation (Hammell et al. 
2009). A single intergenic substitution occurred in each of the other 
two lines, but these are not particularly close to any genes with obvi- 
ous functions in sex determination or fertility, so whether they may 
influence fitness through effects on gene regulation remains to be 
tested. Alternatively, these mutations may be neutral, having simply 
hitchhiked on the recombinant chromosome that became fixed in 
each population. 



Selective pressures 

Simulations strongly suggest that selection led to the observed genomic 
patterns, but they do not reveal the precise nature of those selective 
pressures. One possibility is that the multilocus genotypes observed 
in the evolved lines are the result of selection against deleterious 
combinations of epistatically interacting alleles segregating among 
the founder genetic backgrounds. Indeed, such incompatibilities are 
probably common (Corbett-Detig et al. 2014), and at least one good 
example is known in C. elegans (Seidel et al. 2008). Moreover, there 
is some evidence for selection against outcrossing in C. elegans in 
nature (Barriere and Felix 2007; Rockman and Kruglyak 2009). Al- 
though this scenario may explain some of the loci that became fixed 
in the evolved lines, I suspect that at least some of the fixed alleles in 
these populations were selected to compensate for the tra-2(ar221); 
xol-l(y9) mutant genotype fixed in these populations. First, these 
mutations are highly deleterious and were fixed, so I expect selection 
to suppress their effects to be quite strong. Indeed, the nature of the 
phenotypic recovery seen in these lines — e.g., the suppression of 
intersexed phenotypes — suggests that some of the increase in fitness 
in these populations is attributable to compensatory adaptation. 
Such compensatory evolution is probably common, as evidenced 
by a number of studies of both laboratory and natural systems (e.g., 
(Moore et al. 2000; Estes and Lynch 2003; Kulathinal et al. 2004; 
Pischedda and Chippindale 2005; Harcombe et al. 2009; Stoebel 
et al. 2009). 

In addition, the "ancestor" sequence data provide some suggestive 
clues. This pooled "ancestor" consisted of gl worms that had under- 
gone one generation of outcrossing before being frozen, thawed, and 
subsequently recovering for three to four generations. Thus, there was 
some, albeit limited, opportunity for selection to act before DNA 
isolation. During this process, worms were cultured at a permissive 
temperature (13°), at which the deleterious effects of these mutations 
are largely absent, so selection to compensate for their effects should 
have been weak, providing something of a negative control. Consistent 
with the idea that selection for compensatory adaptation explains 
allele frequencies in the evolved lines, genomic patterns in the evolved 
lines are distinct from those of the "ancestral" sample. These differ- 
ences should be interpreted cautiously because of the small number of 
generations "ancestral" worms were reared at the permissive temper- 
ature and because of the potential for genetic drift in those popula- 
tions after thawing. Nevertheless, they provide a possible hint that 
selection acts differently when the tra-2(ar221); xol-l(y9) genotype's 
deleterious effects are expressed, suggesting compensatory adaptation. 

Unfortunately, generating a true negative control for this experi- 
ment would be difficult. For instance, populations seeded with the 
same wild genetic backgrounds but lacking the mutations would 
exhibit very different outcrossing rates, as males are typically main- 
tained at relatively high frequencies in genetically mixed populations of 
C. elegans (Anderson et al. 2010; Teotonio et al. 2012). Thus, compar- 
isons of genomic patterns between such wild-type "controls" and these 
experimental populations would be difficult to interpret. 

Fitness recovery in remaining populations 
and conclusions 

The three evolved lines investigated here were chosen from a larger set 
of 20 because of their especially strong phenotypic recovery. But if the 
fitness increase in these lines was driven largely by pre-existing genetic 
variants, why was the evolutionary response in the other seventeen 
lines noticeably weaker? First, it is possible that the few detected novel 
substitutions did actually contribute to the strong response seen in 
these three lines, or that this strong recovery is caused by additional 
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mutations that were undetected due to low sequencing coverage. An 
alternative explanation is that the stochastic loss of beneficial ancestral 
alleles early on in the experiment may have prevented most of the 
evolved lines from attaining such a robust recovery. Deeper sequenc- 
ing of all the evolved lines, especially using paired-end sequence reads 
to help identify structural mutations, would help resolve this issue. 

Most other evolve-and-resequence studies have focused on pop- 
ulations sitting at two ends of a continuum: either asexual, initially 
isogenic populations, or genetically variable, obligately outcrossing 
systems. In contrast, the populations in this study were seeded with 
a modest amount of genetic variation and exhibited a mixed mating 
system, similar to wild populations of C. elegans. As one might expect, 
genomic outcomes also were intermediate: although there was some 
recombination, ancestral variants became fixed in large genomic 
blocks, and there was evidence for fewer novel substitutions than in 
most microbial studies. This result is consistent with the low out- 
crossing rates and strong selective sweeps seen in natural popula- 
tions of C. elegans. These results thus demonstrate the utility of the 
evolve-and-resequence approach in exploring how patterns of geno- 
mic diversity are shaped by factors such as selection, drift, mutation, 
and mating system. 
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